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Abstract 

The effect of defects on the melting profile of short heterogeneous DNA 
chains are calculated using the Peyrard-Bishop Hamiltonian. The on-site 
potential on a defect site is represented by a potential which has only the 
short-range repulsion and the flat part without well of the Morse potential. 
The stacking energy between the two neigbouring pairs involving a defect site 
is also modified. The results are found to be in good agreement with the 
experiments. 
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DNA thermal denaturation is the process of separating the two strands wound in a double 
helix into two single strands upon heating. Several experiments [|T],[| on dilute DNA solutions 
have provided evidence for the existence of a thermally driven melting transition correspond- 
ing to the sudden opening of base pairs at a critical(or melting) temperature T m . At T m 
one-half of the DNA denatured. The understanding of this remarkable one- dimensional co- 
operative phenomena in terms of standard statistical mechanics, i.e., a Hamiltonian model 
with temperature independent parameters is a subject of current interest • 

DNA is modelled as a quasi one dimensional lattice composed of N base pair units || . 
The double helix is held together by hydrogen bonds between complementary bases on 
opposite strands and hydrophobic or stacking interactions between nearest neighbor bases 
on opposite strands. Each base pair is in one of the two states; either open (non-hydrogen) 
or intact (hydrogen-bonded). A Hamiltonian model which has been found appropriate to 
represent interactions in DNA is the Peyrard-Bishop (PB) model The PB model is 

written as, 

H = £[f| + W (Vn> Vn+l) + VniVn)} (1) 

where m is the reduced mass of a base pair, y n denotes the stretching of the hydrogen bonds 
connecting the two bases of the n th pair and 

,dy n , 
Pn = m{— ) 

The on site potential V n (y n ) describes the interaction of the two bases of the n th pair. 
The Morse potential, 

V n (y n ) = D n (e^ - l) 2 (2) 

which is usually taken to represent the on-site interaction represents not only the H bonds 
connecting two bases belonging to opposite strands, but also the repulsive interactions of 
the phosphates and the surrounding solvents effects. The flat part at large values of the 
displacement of this potential emulates the tendency of the pair "melt" at high temperatures 



as thermal phonons drive the nucleotides outside the well (see Fig.l) and towards the flat 
portion of the potential. 

The stacking energy between the two neighboring base pairs is described by the anhar- 
monic potential 

W(y n ,y n+1 ) = ^[1 + p e- a ^ + y^](y n - y^f (3) 

The choice of this form of W(y n , y n -x) has been motivated by the observation that 
the stacking energy is not a property of individual bases but a character of the base pair, 
themselves [0] . When due to stretching the hydrogen bonds connecting the bases break, the 
electronic distribution on bases is modified causing the stacking interaction with adjacent 
bases to decrease. This is taken into account by the exponential term in Eq.(3). One may 
note that the effective coupling constant decreases from k(l + p) to k when either one of the 
two interacting base pairs is stretched. Eq.(3), therefore, takes care of changes in stacking 
energy due to breaking of hydrogen bonds in base pairs due to stretching. This decrease in 
coupling provides a large entropy in the denaturation. The parameter a in Eqs.(3) defines 
the " anharmonic range" . 

The model Hamiltonian of Eq.(l) has extensively been used to study the melting profile 
of a very long (N — > oo) and homogeneous DNA chain using both statistical mechanical 
calculations and the constrained temperature molecular dynamics [|K],|ll|]. Analytical inves- 
tigation of nonlinear dynamics of the model suggests that intrinsic energy localization can 
initiate the denaturation |12[. In case of long homogeneous DNA chain the model exhibits 
a peculiar type of first-order transition with finite "melting entropy", a discontinuity in the 
fraction of bound pairs and divergent correlation lengths. However, as the value of the 
stacking parameter a increases and the range of the "entropy barrier" becomes shorter than 
or comparable to the range of the Morse potential the transition changes to second order. 
The crossover is seen at a/a = 0.5 ||. 

When one considers chain having inhomogeneity in base pair sequence, one faces a prob- 
lem in investigating its statistical mechanics as the transfer-integral method which has been 
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used in the case of homogeneous chain is no longer valid. Attempts have, however, been made 
to use the model Hamiltonian of Eq.(l) for heterogeneous chains either by modelling the 
heterogeneity with quenched disorder || or by properly choosing basis sets of orthonormal 
functions for the kernels appearing in the expansion of the partition function |7]]. 

In this note we investigate the effect of defects on the melting profile of short DNA 
chains of heterogeneous compositions. A defect on DNA chain means a mismatched base- 
pair. For example, if one strand of DNA has adenine on a site the other strand has guanine 
or cytosine instead of thymine on the same site. In such a situation the pair will remain in 
open state at all temperatures as two nucleotides cannot join each other through hydrogen 
bonds. Oligonucleotide probes are commonly used to identify the presence of unrelated 
nucleic acids. In this context it is therefore important to discriminate the targets that differ 
from one another by a single or more nucleotides. 

Since the partition function of the PB model is convergent only in the limit of infinite 
number of base-pairs N (i.e. N — ► oo) [|], it is therefore, necessary in the case of short 
chains to consider not only the breaking of hydrogen bonds between single base pair, but 
also the complete dissociation of the two strands forming the double helix. In other words, in 
the case of short chains there is a need to consider thermal equilibrium between dissociated 
strands and associated double strands(the duplex) and a thermal equilibrium in the duplexes, 
between broken and unbroken interbase hydrogen bonds. 

The average fraction 9 of bonded base pairs can be factored as 9 = 9 ext 9 int 9 ext 



is the average fraction of strands forming duplexes, while 9i nt is the average fraction of 
unbroken bonds in the duplexes. The equilibrium dissociation of the duplex C 2 to single 
strand C\ may be represented by the relation C2 2C\ The dissociation equilibrium 

can be neglected in the case of long chains; while 9i nt and thus 9 go to zero when 9 ext is still 
practically 1. This is because in the case of long DNA fragments when 9 goes practically 
from 1 to at the melting transition, the two strands may not get completely separated; 
while most bonds are disrupted and the DNA has denatured, few bonds still remaining 
prevent the two strands from going apart each other. It is only at T >> T m there will be 



a real separation. Therefore, at the transition the double strand is always a single molecule 
and in calculation based on the PB model one has to calculate only 9 int . On the contrary, in 
the case of short chains the process of single bond disruption and strand dissociation tend 
to happen in the same temperature range; therefore, the computation of both Qi nt and 9 ex t 
is essential. 

For the computation of 9i nt one has to separate the configurations describing a double 
strand on the one hand, and dissociated single strand on the other. For this we follow the 



method suggested by Campa and Giansanti [[T3(]. The n bond is considered open if the 



value of y n is larger than a chosen threshold yo. A configuration belongs to the double 
strands if at least one of the y'^ is smaller than y . One can therefore define 9i nt for an N 
base pair duplexes by: 

i i=N 

fl«=vE(%-tt)) ( 4 ) 
iV i=i 

where "&{y) is Heaviside step function and the canonical average (.) is defined considering 
only the double strand configurations. For y , we have taken a value of 2 A. Since the PB 
model couples only the nearest neighbors the calculation of canonical averages in Eqs.(4) 
reduced to multiplication of finite matrices. The discretization of the coordinate variables 
and introduction of a proper cutoff on the maximum value of y' s Jl(| determines the size 



of the matrices and the number of base pairs in the chain the number of matrices to be 
multiplied. 

Since we are concerned with a heterogenous DNA chain, we have selected two different 
values of D n (see Eq. 2) according to the two possible base pairs; adenine-thymine (A- 
T) and guanine-cytosine(G-C). While A-T has the two hydrogen bonds, the G-C pair has 
three. Because of this D n for G-C bonds is chosen as nearly 1.5 times larger than the one 
representing to the A-T bonds. The complete set of values in our calculations are:D^T = 
0.05 eV, D GC = 0.075 eV, a AT = 4.2 A" 1 , a GC = 6.9, A" 1 k = 0.025 eV/A 2 , p = 
2, and a = 0.35 A" 1 

Since at a defect site the nucleotides of the two strands do not associate themselves 



through hydrogen bonds we replace the on-site Morse potential by a potential shown in 
Fig.l by full line. 
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Fig. 1 The on-site potential V(y) as a function of displacements. The dotted line represents 
the Morse potential (see Eqn.(2) and the full line the potential chosen to represent the interaction 
at the defect sites. 

This potential has repulsive part as well as the flat part of the Morse potential but not the 
well which arises due to hydrogen bonding interactions. Due to defect on a site the stacking 
interactions with adjacent bases will also be affected. As argued above, the formation of 
hydrogen bonds changes the electronic distribution on base pairs causing stronger stacking 
interactions with adjacent bases. Therefore, when one of the base pairs without hydrogen 
bonds is involved the stacking interaction will be weaker compared to the case when both 
base pairs are in intact state. This fact has been taken into account in our calculation by 
reducing the anharmonicity coefficient p to its half value wherever the defective site appeared 
in Eq.(3). 

For 9 ext we use the expression given in Ref. [13]. Thus, 



ext = 1 + 5- V5 2 + 25 



(5) 



where 




(6) 



2N Z(AB) 
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Here Z(A), Z(B) and Z(AB) are the configurational isothermal- isobaric partition func- 
tions of systems consisting of molecular species A, B and AB respectively. Nj is the number 
of molecules of species j in volume V and 2N = 2Nab + Na + Nb- In deriving Eq.(6) Na 
has been taken equal to Nb and 9 ext defined as 





"ext t>t 
iV 

The partition function for each species can be factored into internal and external part 



and can be written as [0 



Z(A)Z(B) Z int (A)Z int (B) (A)Z ext (B) 

2N Q Z(AB) a av Z int (AB) 2N Z ext (AB) [ ' 

where a av . = ^a AT a G c 

In analogy to what has been proposed for the Ising model || on the basis of partition 
function of rigid molecules fl6j , one makes the following choice 



aav.Z ext {A)Z ext (B) = n^ N - p e int + q / g x 
2N Z ext (AB) n 1 ' 

where n* is a chosen reference concentration as lfiM while n Q is the single strand con- 
centration which we have chosen as 3.1 fiM. p and q are the parameters which can be 
calculated using experimental results. 

We have considered the following two different oligonucleotides with sequence given by: 

(A) 5 ' GTGTT AACGTG AGT AT AGCGTy 

(B) 5 'GGT n GG 3 , 

and by the respective complementary strands when there is no defect. The melting profile 
of oligonucleotides (A) has been calculated by Campa and Giansanti Jl3|. They found very 
good agreement with experiment. We have extended their calculation and have studied the 
effect of defects on the melting profile. The results are plotted in Fig. (2). Our results for 



defectless chain agree very well with that given in refs. Jl3| and with experiments Jl4| 
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Fig. 2 Plot showing the variation of melting temperature with defect. Here p = 29.49 and 
q = 27.69 
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Fig. 3 Plot showing the differential melting curve 

We used the same parameters except the modification as stated above in the Morse 
potential and the anharmonic stacking potential term to calculate the melting profile with 
one defect on a site. Our calculation indicates that the melting profile of the chain does 
not depend on the location of the defect site. When we introduced two defects one at A th 
and the other at 9 th sites we found the melting curve shifts to further lower temperature. 
While we found the melting temperature T m decreases from 335.7 K to nearly 329.7 K (i.e. 
AT^ ' 1 ) ~ 6K) in the presence of one defect, the decrease in T m from one to two defects is 



8 



only AT^ 1,2) ~ 3K. These results are in good agreement with the experimental observations 
fl4| . We also put the two defects at the consecutive sites; i. e., at 8 th and 9 th sites. As Fig. 
2 shows there is a change in the melting profile compared to the case when the two defect 
sites were apart. This change is more clearly seen in the plot of d(f)/dT shown in Fig. 3. 

The oligonucleotides (B) has been studied by Bonnet et al ||15|| . They measured the 
change in the melting temperature with one defect placed on different sites of the chain. 
They found T m = 315K for defectless chain and AT m = 8K with one defect. Their results 
shows that the location of the defect on the chain has practically no effect on the melting 
temperature of the chain in agreement with our calculations. 
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Fig. 4 Plot showing the variation of melting temperature with defect. Here p = 34.46 and 
q = 32.45 
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Fig. 5 Plot showing differential melting curves 



We plot our results in Fig. 4. The force parameters are same as given in Eq.(5). Since 
the experimental conditions are slightly different than the one for chain (A) we adjusted 
parameters p and q so that melting profile for the defect less chain agrees with the experi- 
mental result. Using these values of the parameters we have calculated the melting profile 
with one and two defects. We find that while AT^ ~ 10 K and AT^ 1,2) ~ 3K. Fig. 5 
shows the plot of d(j)/dT. 

In conclusion we wish to emphasize that our calculations show that the PB model is 
capable of describing the melting profile of oligonucleotides of heterogeneous composition in 
the presence of defects also. 

This work was supported by the Department of Science and Technology, (India) through 
research grant. 
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